Python Job: Production Reliability & Support Expert (SRE)

Job added on

Location

Montreal, Quebec - Canada

Job type

Full-Time

Python Job Details

Only local to Canada are eligible!!

Level 4
Job Description:


Years of experience : 3 to 5 years
Cyber Data Risk & Resilience Identity & Access Management


About us:
We are a leading global Financial Services Firm operating in 43 countries and a world leader in Investment Banking, Securities, Investment Management and Wealth Management services, our mission is to provide our clients with best-of-class products and innovative solutions to help them achieve their goals. As part of Access Management, we partner with our clients in technology and the business to manage controls and processes that ensures that personnel or systems are only granted access to assets necessary to perform their function and that access is revoked once no longer required.

What would I be working on? We are growing our team globally. It s a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by developing a deep understanding of how our application code is running, configured, and scaled. This allows us to effectively resolve open incidents in the shortest amount of time, develop monitors to detect future occurrences and implement automation technologies to enable the environment to self-heal. Our team manages all entitlements/accesses in Production in a scope of more than 35 systems and user distributed globally around the world with accesses span.


Role and Responsibilities:
Ensure Production Management is closely aligned/embedded in the Agile software development process and our code meets production standards
Incorporate System Reliability Engineering and DevOps implementations into the day-to-day role by developing automated solutions to long standing problems to ensuring minimal downtime and manual effort
Configuring application monitors using industry standard monitoring tools, as well as developing customized monitoring solutions
Build extensive business and application knowledge required for supporting client facing applications
Revisit SRE Metrics and confirm against the firm and department goals
Implement tooling / create automations to help with Toil Elimination (manual or repetitive work)
Engage early in SDLC with our Development teams to have an active role in creating a resilient and reliable solution
Prioritize project work based on critical incidents and key business stakeholders
Interface with clients and other technology teams to provide governance and control around the production environment.


Qualifications: You should apply on this requisition if you have, at minimum, the following profile:
Bachelor s degree in Computer Science or related field
Experience with Service Oriented Architecture, Distributed Systems, Business Intelligence Reporting such as PowerBI, Scripting such as Python or shell, Front end development (HTML, Java Script, AngularJS), Cloud Computing such as MS AZURE and SaaS integrations
Clear understanding of Logging, Monitoring, and Knowledge Management practices such as Docs as Code
Ability to manage an incident call and coordinate multiple teams towards a common goal of resolving a business impactful outage, once trained
Strong knowledge of DevOps and SRE Principles with grasp over tools / approach to apply them
Strong infrastructure knowledge in Linux / Unix admin, Storage, Networking and Web Technologies
Advanced Unix Shell / Python scripting experience
Advanced SQL query language knowledge such as Sybase, DB2, MongoDB and Snowflake preferred.


We are an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.